Hardware
InfiniPot-V: Memory-Constrained KVCache Compression for Streaming Video Understanding
Modern multimodal large language models (MLLMs) can reason over hour-long video, yet their key-value (KV) cache grows linearly with time--quickly exceeding the fixed memory of phones, AR glasses, and edge robots. Prior compression schemes either assume the whole video and user query are available offline or must first build the full cache, so memory still scales with stream length. InfiniPot-V is the first training-free, query-agnostic framework that enforces a hard, lengthindependent memory cap for streaming video understanding. During video encoding it monitors the cache and, once a user-set threshold is reached, runs a lightweight compression pass that (i) removes temporally redundant tokens via Temporal-axis Redundancy (TaR) metric and (ii) keeps semantically significant tokens via Value-Norm (VaN) ranking. Across four open-source MLLMs and four long-video and streaming-video benchmarks, InfiniPot-V cuts peak GPU memory by up to 94%, sustains real-time generation, and matches or surpasses full-cache accuracy--even in multi-turn dialogues. By dissolving the KV cache bottleneck without retraining or query knowledge, InfiniPot-V closes the gap for on-device streaming video assistants.
Get a new PC ready in minutes with this 35 transfer kit
When you purchase through links in our articles, we may earn a small commission. Setting up a new computer usually means hours spent reinstalling programs, hunting down license keys, and figuring out what to do with your old machine. PCmover Professional handles the migration itself, moving your apps, files, and settings from an old PC to a new one, even across different Windows versions. Programs arrive fully installed and ready to use, so there's no need to dig up license keys or reinstall from scratch. Free transfer assistance is available if you get stuck.
Cancel Adobe Acrobat--This PDF editor lifetime license is only 65 during Deal Days
When you purchase through links in our articles, we may earn a small commission. A lifetime license for UPDF, an all-in-one PDF editor for Windows, Mac, iOS, and Android, is on sale for $64.97 (reg. Adobe Acrobat charges a subscription for the privilege of editing a PDF, and the bill never stops. UPDF takes a one-time payment instead and covers most of what Acrobat does. During Deal Days, a lifetime license is only $64.97 (reg.
Analog In-memory Training on General Non-ideal Resistive Elements: The Impact of Response Functions
As the economic and environmental costs of training and deploying large vision or language models increase dramatically, analog in-memory computing (AIMC) emerges as a promising energy-efficient solution. However, the training perspective, especially its training dynamics, is underexplored. In AIMC hardware, the trainable weights are represented by the conductance of resistive elements and updated using consecutive electrical pulses. While the conductance changes by a constant in response to each pulse, in reality, the change is scaled by asymmetric and non-linear response functions, leading to a non-ideal training dynamics. This paper provides a theoretical foundation for gradient-based training on AIMC hardware with nonideal response functions.
AtAtT!T" O!O" Al-to-AlE!E" E# E$ AtT!T" O!O"FFNAtGaT!T" O!O" GaGaE!E" E# E$ Al-to-Al(a(b(clllltetetete))) tetete((DCnnnnDTMoiomttttrsiiiiiaoooopsnnnnansbEttricnihbfe)o)urtmede rMoE
The computational sparsity of Mixture-of-Experts (MoE) models enables sublinear growth in compute cost as model size increases, thus offering a scalable path to training massive neural networks. However, existing implementations suffer from low GPU utilization, significant latency overhead, and a fundamental inability to leverage task locality, primarily due to CPU-managed scheduling, host-initiated communication, and frequent kernel launches. To overcome these limitations, we develop FlashMoE, a fully GPU-resident MoE operator that fuses expert computation and inter-GPU communication into a single persistent GPU kernel. FlashMoE enables fine-grained pipelining of dispatch, compute, and combine phases, eliminating launch overheads and reducing idle gaps. Unlike existing work, FlashMoE obviates bulk-synchronous collectives for one-sided, device-initiated, inter-GPU (R)DMA transfers, thus unlocking payload efficiency, where we eliminate bloated or redundant network payloads in sparsely activated layers. When evaluated on an 8-H100 GPU node with MoE models having up to 128 experts and 16K token sequences, FlashMoE achieves up to 9 higher GPU utilization, 6 lower latency, 5.7 higher throughput, and 4 better overlap efficiency compared to state-of-the-art baselines--despite using FP32 while baselines use FP16. FlashMoE shows that principled GPU kernel-hardware co-design is key to unlocking the performance ceiling of large-scale distributed ML.
MSI MPG 341CQR QD-OLED X36 review: A top-tier ultrawide monitor
When you purchase through links in our articles, we may earn a small commission. The MSI MPG 341CQR isn't quite perfect, but it delivers great image quality and plenty of features at an attractive price. That makes it easy to recommend for both work and play. The MSI MPG 341CQR isn't quite perfect, but it delivers great image quality and plenty of features at an attractive price. That makes it easy to recommend for both work and play. It offers a 5th-generation Samsung QD-OLED panel, a refresh rate of 360Hz, a USB-C port with 98 watts of Power Delivery, and it is VESA DisplayHDR True Black 500 certified.
PC building's weird new reality: Your favorite old parts are back on the menu
PCWorld reports that rising RAM and storage prices are driving hardware vendors to re-release older components like AMD's Ryzen 7 5800X3D and GeForce RTX 30-series cards. This trend matters for PC builders seeking affordable alternatives as memory shortages and cost increases make newer hardware less accessible. The shift encourages enthusiasts to find creative solutions with existing components and embrace the joy of tinkering despite market challenges. Your weekly edition of The Full Nerd has arrived, and there's a new face on the team: mine!
I can't stop using this website that lets me drive through cities around the world
When you purchase through links in our articles, we may earn a small commission. I can't stop using this website that lets me drive through cities around the world Drive and Listen combines street-level video with live local radio for a surprisingly immersive experience--even if it's only from your desktop. Drive and Listen might not sound like it, but it's strangely relaxing once you try it. The site provides exactly the experience the name suggests: Pick a city anywhere in the world, press play, and sit back as you ride along through the streets listening to local radio stations. The site was created during the pandemic by a student in Turkey who missed traveling and wanted a way to reconnect with places beyond his own neighborhood.
The Best Fitness Trackers of 2026: Garmin, Google Fitbit, and More
Find the right wearable for your lifestyle, workouts, and goals. Like every piece of gear you wear on your body day in and day out, fitness trackers are incredibly personal. The right tracker for you should be comfortable, accurate, and tailored to your lifestyle, including your preferred workouts and health goals. Do you bike, row, or strength train? Do you run on trails for hours at a time, or do you just want a reminder to stand up every hour? Do you want to wear it on your wrist or your finger, or tuck it into your sports bra? No matter what your needs are, there's never been a better time to find a powerful, sophisticated tool to help optimize your workouts or jump-start your routine. We test dozens of fitness trackers every year while running, climbing, hiking, or just doing workout videos on our iPads at night, to bring you these picks. For more wearables, check out our guides to the Best Smartwatches, Best Smart Rings, and Best Sleep Trackers . Garmin makes some of the most accurate fitness trackers on the market, and the Vivoactive 6 is the best midrange option for most people.